
RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/995,860 

Input Set : A:\SEQUENCE LSITING.txt 
Output Set: N:\CRF4\08092002\I995860.raw 



DATE: 08/09/2002 
TiME: 14:42:30 



and 



C--> 



3 
5 

6 
8 
10 
12 
12 
14 
16 
18 
19 
20 
22 
24 
27 
29 
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31 
33 
35 
37 
40 
42 
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<110> APPLICANT: Innogenetics N.V. 

<120> TITLE OF INVENTION: Purified hepatitis C virus envelope proteins for diagnostic 

therapeutic use. 
<130> FILE REFERENCE: 2551-69 

<140> CURRENT APPLICATION NUMBER: 09/995,860 
<141> CURRENT FILING DATE: 2001-11-29 

<160> NUMBER OF SEQ ID NOS : 122 
<170> SOFTWARE: Patentln 3.1 
<210> SEQ ID NO: 1 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Hepatitis C virus 
<400> SEQUENCE: 1 

ggcatgcaag cttaattaat t 21 
<210> SEQ ID NO: 2 
<211> LENGTH: 68 
<212> TYPE: DNA 

<213> ORGANISM: Hepatitis C virus 
<400> SEQUENCE: 2 

ccggggaggc ctgcacgtga tcgagggcag acaccatcac caccatcact aatagttaat 60 
taactgca 68 
<210> SEQ ID NO: 3 
<211> LENGTH: 642 
<212> TYPE: DNA 

<213> ORGANISM: Hepatitis C virus 
<220> FEATURE: 
<2 21> NAME/KEY: CDS 
<222> LOCATION: 1..639 
<220> FEATURE: 
<221> NAME/KEY 
<222> LOCATION 
<400> SEQUENCE 



mat_peptide 
1. .636 
3 

atg ccc ggt tgc tct ttc tct ate ttc etc ttg get tta 
Met Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 

15 10 
ctg acc att cca get tec get tat gag gtg cgc aac gtg 
Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

20 25 
tac cat gtc acg aac gac tgc tec aac tea age att gtg 
Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
35 40 45 

gcg gac atg ate atg cac acc ccc ggg tgc gtg ccc tgc 
Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 



ctg tec tgt 
Leu Ser Cys 
15 

tec ggg atg 
Ser Gly Met 
30 

tat gag gca 
Tyr Glu Ala 

gtt egg gag 
Val Arg Glu 



48 



96 



144 



192 
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71 


50 










55 








60 












73 aac 


aac 


tct 


tec 


cgc 


tgc 


tgg 


gta 


gcg 


etc 


ace ccc 


acg 


etc 


gca 


get 


240 


74 Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr Pro 


Thr 


Leu 


Ala 


Ala 




75 65 










70 










75 








80 




77 agg 


aac 


gee 


age 


gtc 


ccc 


ace 


acg 


aca 


ata 


cga cgc 


cac 


gtc 


gat 


ttg 


288 


78 Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


He 


Arg Arg 


His 


Val 


Asp 


Leu 




79 








85 










90 








95 






81 etc 


gtt 


ggg 


gcg 


get 


get 


etc 


tgt 


tec 


get 


atg tac 


gtg 


ggg 


gat 


etc 


336 


82 Leu 


Val 


Gly 


Ala 


Ala 


Ala 


Leu 


Cys 


Ser 


Ala 


Met Tyr 


Val 


Gly 


Asp 


Leu 




83 






100 










105 








110 








85 tgc 


gga 


tct 


gtc 


ttc 


etc 


gtc 


tec 


cag 


ctg 


ttc ace 


ate 


teg 


cct 


cgc 


384 


86 Cys 


Gly 


Ser 


Val 


Phe 


Leu 


Val 


Ser 


Gin 


Leu 


Phe Thr 


He 


Ser 


Pro 


Arg 




87 




115 










120 








125 










89 cqq 


cat 


qaq 

j zj 


acg 


gtg 


cag 


gac 


tgc 


aat 


tgc 


tea ate 


tat 


ccc 


ggc 


cac 


432 


90 Arg 


His 


Glu 


Thr 


Val 


Gin 


Asp 


Cys 


Asn 


Cys 


Ser He 


Tyr 


Pro 


Gly 


His 




91 


130 










135 








140 












93 ata 


aca 


qqt 


cac 


cgt 


atg 


get 


tgg 


gat 


atg 


atg atg 


aac 


tgg 


teg 


cct 


480 


94 He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


Asp 


Met 


Met Met 


Asn 


Trp 


Ser 


Pro 




95 145 










150 










155 








160 




97 aca 


acg 


qcc 


ctq 


gtg 


gta 


tcq 


cag 


ctg 


etc 


egg ate 


cca 


caa 


get 


gtc 


528 


98 Thr 


Thr 


Ala 


Leu 


Val 


Val 


Ser 


Gin 


Leu 


Leu 


Arg He 


Pro 


Gin 


Ala 


Val 




99 








165 










170 








175 






101 gtg 


gac 


atg 


gtg 


gcg 


ggg 


qcc 

ZJ 


cat 


tqq 


qqa 


gtc ctg 


qcq 


qqc 

ZZ> ZJ 


etc 


gec 


576 


102 Val 


Asp 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val Leu 


Ala 


Gly 


Leu 


Ala 




103 






180 










185 








190 








105 tac 


tat 


tec 


atg 


gtg 


ggg 


aac 


tgg 


get 


aaq 


gtt ttg 


att 


qtq 


atq 


eta 


624 


106 Tyr 


Tyr 


Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val Leu 


He 


Val 


Met 


Leu 




107 




195 










200 








205 










109 etc 


ttt 


get 


etc 


taatag 




















642 


110 Leu 


Phe 


Ala 


Leu 


























111 


210 






























114 <210> SEQ ID NO 


: 4 
























116 <211> LENGTH: 212 
























117 <212> TYPE: 


PRT 


























118 <213> ORGANISM: 


Hepatitis C 


virus 
















121 <400> SEQUENCE: 


4 
























123 Met 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu Ala 


Leu 


Leu 


Ser 


Cys 




124 1 








5 










10 








15 






12 6 Leu 


Thr 


He 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg Asn 


Val 


Ser 


Gly 


Met 




127 






20 










25 








30 








129 Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser He 


Val 


Tyr 


Glu 


Ala 




130 




35 










40 








45 










132 Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val Pro 


Cys 


Val 


Arg 


Glu 




133 


50 










55 








60 












135 Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr Pro 


Thr 


Leu 


Ala 


Ala 




136 65 










70 










75 








80 




138 Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


He 


Arg Arg 


His 


Val 


Asp 


Leu 




139 








85 










90 








95 






141 Leu 


Val 


Gly 


Ala 


Ala 


Ala 


Leu 


Cys 


Ser 


Ala 


Met Tyr 


Val 


Gly 


Asp 


Leu 
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142 




100 










105 










110 






144 


Cys Gly Ser 


Val 


Phe 


Leu 


Val 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


Ser 


Pro 


Arg 


145 


115 










120 










125 








147 


Arg His Glu 


Thr 


Val 


Gin 


Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly 


His 


148 


130 








135 










140 










150 


lie Thr Gly 


His 


Arg 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


151 


145 






150 










155 










160 


153 


Thr Thr Ala 


Leu 


Val 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


154 






165 










170 










175 




156 


Val Asp Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


157 




180 










185 










190 






159 


Tyr Tyr Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


He 


Val 


Met 


Leu 


160 


195 










200 










205 








162 


Leu Phe Ala 


Leu 


























163 


210 




























166 


<210> SEQ ID NO 


5 
























168 


<211> LENGTH: 795 
























169 


<212> TYPE: 


DNA 


























170 


<213> ORGANISM: 


Hepatitis C 


virus 
















172 


<220> FEATURE: 


























173 


<221> NAME/KEY: 


CDS 
























174 


<222> LOCATION: 


1. . 


792 






















176 


<220> FEATURE: 


























177 


<221> NAME/KEY: 


mat. 


_peptide 




















178 


<222> LOCATION: 


1. . 


789 






















180 


<400> SEQUENCE: 


5 
























182 


atg ttg ggt 


aag 


gtc 


ate 


gat 


acc 


ctt 


aca 


tgc 


ggc 


ttc 


gcc 


gac 


etc 


183 


Met Leu Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


Gly 


Phe 


Ala 


Asp 


Leu 


184 


1 




5 










10 










15 




186 


gtg ggg tac 


att 


ccg 


etc 


gtc 


ggc 


gcc 


ccc 


eta 


ggg 


ggc 


get 


gcc 


agg 


187 


Val Gly Tyr 


He 


Pro 


Leu 


Val 


Gly 


Ala 


Pro 


Leu 


Gly Gly 


Ala 


Ala 


Arg 


188 




20 










25 










30 






190 


gcc ctg gcg 


cat 


ggc 


gtc 


egg 


gtt 


ctg 


gag 


gac 


ggc 


gtg 


aac 


tat 


gca 


191 


Ala Leu Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 


Gly Val 


Asn 


Tyr 


Ala 


192 


35 










40 










45 








194 


aca ggg aat 


ttg 


ccc 


ggt 


tgc 


tct 


ttc 


tct 


ate 


ttc 


etc 


ttg 


get 


ttg 


195 


Thr Gly Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala 


Leu 


196 


50 








55 










60 










198 


ctg tec tgt 


ctg 


acc 


gtt 


cca 


get 


tec 


get 


tat 


gaa 


gtg 


cgc 


aac 


gtg 


199 


Leu Ser Cys 


Leu 


Thr 


Val 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn 


Val 


200 


65 






70 










75 










80 


202 


tec ggg atg 


tac 


cat 


gtc 


acg 


aac 


gac 


tgc 


tec 


aac 


tea 


age 


att 


gtg 


203 


Ser Gly Met 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


Val 


204 






85 










90 










95 




206 


tat gag gca 


gcg 


gac 


atg 


ate 


atg 


cac 


acc 


ccc 


ggg 


tgc 


gtg 


ccc 


tgc 


207 


Tyr Glu Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro 


Cys 


208 




100 










105 










110 






210 


gtt egg gag 


aac 


aac 


tct 


tec 


cgc 


tgc 


tgg 


gta 


gcg 


etc 


acc 


ccc 


acg 


211 


Val Arg Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 



48 



96 



144 



192 



240 



288 



336 



384 
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212 






115 










120 










125 










214 


etc 


gca 


get 


agg 


aac 


gec 


age 


gtc 


ccc 


ace 


acg 


aca 


ata 


cga 


cgc 


cac 


432 


215 


Leu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


He 


Arg 


Arg 


His 




216 




130 










135 










140 












218 


gtc 


gat 


ttg 


etc 


gtt 


9" 9" 9" 


gcg 


get 


get 


ttc 


tgt 


tec 


get 


atg 


tac 


gtg 


480 


219 


Val 


Asp 


Leu 


Leu 


Val 


Gly 


Ala 


Ala 


Ala 


Phe 


Cys 


Ser 


Ala 


Met 


Tyr 


Val 




220 


145 










150 










155 










160 




222 


9" 9" 9" 


gac 


etc 


tgc 


gga 


tct 


gtc 


ttc 


etc 


gtc 


tec 


cag 


ctg 


ttc 


ace 


ate 


528 


223 


Gly 


Asp 


Leu 


Cys 


Gly 


Ser 


Val 


Phe 


Leu 


Val 


Ser 


Gin 


Leu 


Phe 


Thr 


He 




224 










165 










170 










175 






225 


teg 


cct 


cgc 


egg 


cat 


gag 


acg 


gtg 


cag 


gac 


tgc 


aat 


tgc 


tea 


ate 


tat 


576 


226 


Ser 


Pro 


Arg 


Arg 


His 


Glu 


Thr 


Val 


Gin 


Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 




227 








180 










185 










190 








229 


ccc 


ggc 


cac 


ata 


acg 


ggt 


cac 


cgt 


atg 


get 


tgg 


gat 


atg 


atg 


atg 


aac 


624 


230 


Pro 


Gly 


His 


He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 




231 






195 










200 










205 










233 


tgg 


teg 


cct 


aca 


acg 


gee 


ctg 


gtg 


gta 


teg 


cag 


ctg 


etc 


egg 


ate 


cca 


672 


234 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Pro 




235 




210 










215 










220 












237 


caa 


get 


gtc 


gtg 


gac 


atg 


gtg 


gcg 


ggg 


gec 


cat 


tgg 


gga 


gtc 


ctg 


gcg 


720 


238 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 




239 


225 










230 










235 










240 




241 


ggt 


etc 


gec 


tac 


tat 


tec 


atg 


gtg 


ggg 


aac 


tgg 


get 


aag 


gtt 


ttg 


att 


768 


242 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val 


Gly Asn 


Trp 


Ala 


Lys 


Val 


Leu 


He 




243 










245 










250 










255 






245 


gtg 


atg 


eta 


etc 


ttt 


get 


ccc 


taatag 
















795 


246 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Pro 






















247 








260 




























248 


<210> SEQ ID NO: 


; 6 


























250 


<211> LENGTH: 263 


























251 


<212> TYPE: 


PRT 




























252 


<213> ORGANISM: 


Hepatitis C 


virus 


















255 


<400> SEQUENCE: 


6 


























257 


Met 


Leu 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


Gly 


Phe 


Ala 


Asp 


Leu 




258 


1 








5 










10 










15 






260 


Val 


Gly 


Tyr 


He 


Pro 


Leu 


Val 


Gly 


Ala 


Pro 


Leu 


Gly 


Gly 


Ala 


Ala 


Arg 




261 








20 










25 










30 








263 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 


Glv 


Val 


Asn 


Tvr 


Ala 




264 






35 










40 










45 










266 


Thr 


Gly 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala 


Leu 




267 




50 










55 










60 












269 


Leu 


Ser 


Cys 


Leu 


Thr 


Val 


Pro 


Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn 


Val 




270 


65 










70 










75 










80 




272 


Ser 


Gly 


Met 


Tyr 


His 


Val 


Thr 


Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He 


Val 




273 










85 










90 










95 






275 


Tyr 


Glu 


Ala 


Ala 


Asp 


Met 


He 


Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro 


Cys 




276 








100 










105 










110 








278 


Val 


Arg 


Glu 


Asn 


Asn 


Ser 


Ser 


Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro 


Thr 




279 






115 










120 










125 
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281 


Leu 


Ala 


Ala 


Arg 


Asn 


Ala 


Ser 


Val 


Pro 


Thr 


Thr 


Thr 


He 


Arg 


Arg 


His 


282 




130 










135 










140 










284 


Val 


Asp 


Leu 


Leu 


Val 


Gly 


Ala 


Ala 


Ala 


Phe 


Cys 


Ser 


Ala 


Met 


Tyr 


Val 


285 


145 










150 










155 










160 


287 


Gly 


Asp 


Leu 


Cys 


Gly 


Ser 


Val 


Phe 


Leu 


Val 


Ser 


Gin 


Leu 


Phe 


Thr 


He 


288 










165 










170 










175 




290 


Ser 


Pro 


Arg 


Arg 


His 


Glu 


Thr 


Val 


Gin 


Asp 


Cys 


Asn 


Cys 


Ser 


He 


Tyr 


291 








180 










185 










190 






293 


Pro 


Gly 


His 


He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


Asp 


Met 


Met 


Met 


Asn 


294 






195 










200 










205 








296 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val 


Val 


Ser 


Gin 


Leu 


Leu 


Arg 


He 


Pro 


297 




210 










215 










220 










299 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


300 


225 










230 










235 










240 


302 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


He 


303 










245 










250 










255 




305 


Val 


Met 


Leu 


Leu 


Phe 


Ala 


Pro 




















306 








260 


























309 


<210> SEQ ID NO 


7 
























311 


<211> LENGTH: 633 
























312 


<212> TYPE: 


DNA 


























313 


<213> ORGANISM: 


Hepatitis C 


virus 
















315 


<220> FEATURE: 


























316 


<221> NAME/KEY: 


CDS 
























317 


<222> LOCATION: 


1. . 


530 






















319 


<220> FEATURE: 


























320 


<221> NAME/KEY: 


mat. 


_peptide 




















321 


<222> LOCATION: 


1. . 


527 






















323 


<400> SEQUENCE: 


7 
























325 


atg 


ttg 


ggt 


aag 


gtc 


ate 


gat 


acc 


ctt 


acg 


tgc 


ggc 


ttc 


gcc 


gac 


etc 


326 


Met 


Leu 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


Gly 


Phe 


Ala 


Asp 


Leu 


327 


1 








5 










10 










15 




329 


atg 


ggg 


tac 


att 


ccg 


etc 


gtc 


ggc 


gcc 


ccc 


eta 


ggg 


ggt 


get 


gcc 


aga 


330 


Met 


Gly 


Tyr 


He 


Pro 


Leu 


Val 


Gly 


Ala 


Pro 


Leu 


Gly 


Gly 


Ala 


Ala 


Arg 


331 








20 










25 










30 






333 


gcc 


ctg 


gcg 


cat 


ggc 


gtc 


egg 


gtt 


ctg 


gaa 


gac 


ggc 


gtg 


aac 


tat 


gca 


334 
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